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Abstract —Quality of service (QoS) for a network is charac¬ 
terized in terms of various parameters specifying packet delay 
and loss tolerance requirements for the application. The nnpre- 
dictahle nature of the wireless channel demands for application of 
certain mechanisms to meet the QoS requirements. Traditionally, 
medium access control (MAC) and network layers perform these 
tasks. However, these mechanisms do not take (fading) channel 
conditions into account. In this paper, we investigate the problem 
using cross layer techniques where information flow and joint 
optimization of higher and physical layer is permitted. We 
propose a schednling scheme to optimize the energy consnmption 
of a multiuser multi-access system such that QoS constraints 
in terms of packet loss are fulfilled while the system is able 
to maximize the advantages emerging from multiuser diversity. 
Specifically, this work focuses on modeling and analyzing the 
effects of packet buffering capabilities of the transmitter on the 
system energy for a packet loss tolerant application. We discuss 
low complexity schemes which show comparable performance to 
the proposed scheme. The numerical evaluation reveals useful 
insights about the coupling effects of different QoS parameters 
on the system energy consumption and validates our analytical 
results. 

Index Terms —Multiuser diversity, green commnnications, ra¬ 
dio resource allocation, opportunistic scheduling, Markov chain, 
packet loss-energy trade-off, stochastic optimization. 

I. Introduction 

The QoS parameters like throughput, latency, packet loss 
rate etc., characterize the behavior of network traffic. Specif¬ 
ically, there are some strict hard requirements in terms of 
worst case behavior for multimedia traffic like minimum 
throughput and maximum tolerable packet delay, which need 
to be fulfilled to maintain the quality of experience (QoE) of 
the application. At the same time, system energy efficiency 
has emerged as one of the key performance indicators for 
the wireless network 0-0. For the system design, QoS 
parameters can be treated as degrees of freedom (DoF) to 
achieve high system level energy efficiency. If the application 
is loss and delay tolerant, the DoFs can be exploited to 
maximize the system energy efficiency. 
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In the literature, energy efficient scheduling has been dis¬ 
cussed in different settings for delay limited systems B-M- 
The authors in El propose a scheme which schedules the 
transmission of multimedia packets in such a way that all the 
users have a fair share of packet loss according to their QoS 
requirements, and maximizes the number of the served users 
under the QoS constraints. The author in ITOl addresses the 
importance of packet dropping mechanisms by energy point of 
view. Traditionally, average packet drop rate is considered to 
be one of the most important parameters for system design 
HD, CD. However, QoF for the application, specifically 
multimedia streaming, depends on the other characteristics of 
packet dropping. Average packet drop parameter characterizes 
the behavior of the application on long term basis only. In 
multimedia applications, short term behavior dictates the QoF. 
For example, consider a scenario where the average packet 
drop rate Star is quite small but a large number of packets 
are dropped successively due to the deeply faded wireless 
channel (called bursty packet loss). In spite of fulfilling an 
average packet drop rate guarantee (on long term basis), the 
users will experience a jitter in the perceived QoF (for a 
multimedia application). Thus, QoS must also be defined in 
terms of maximum number of packets allowed to be dropped 
successively in addition to the average packet drop probability. 
This additional parameter characterizing the pattern of the 
dropped packets is termed continuity constraint parameter 
N ini. Packet scheduling constrained by average packet 
drop rate and maximum successive packet drop belongs to 
a class of sequential resource allocation problems, known as 
Restless Multiarmed Bandit Processes (RMBPs) d. This 
problem has been addressed for Asynchronous Transfer Mode 
(ATM) networks in ifTSl . The authors in ifThI discuss a similar 
problem and an optimal dropping scheme with the objective 
to minimize/maximize the packet drop gap is proposed. A 
useful analytical framework is discussed in CD to dimension 
the packet loss burstiness over generic wireless channels and 
a new metric to characterize the packet loss burstiness is 
proposed. 

Traditionally, such problems are handled at upper layers of 
communications through link adaptation or automatic repeat 
request (ARQ) mechanisms. However, bringing this informa¬ 
tion to the physical layer design shows significant merits as the 
information can effectively be used for opportunistic schedul¬ 
ing purposes. The work in O proposes an opportunistic 
scheduling scheme which exploits the DoF available through 
continuity constraint and average packet drop parameters and 
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aims at minimizing the average system energy. The work 
characterizes the effects of the 0 tar and N on the system 
energy consumption. However, the proposed scheme does not 
allow buffering of data packets which is an integral part of the 
resource allocation mechanisms. 

This work generalizes the framework in ifTSll for the case 
when buffering of packets is allowed for a finite number of 
time slots on the transmitter side. This additional DoF poses 
new challenges in terms of modeling and analysis of the 
problem because a buffer provides multiple opportunities to 
exploit multiuser diversity as compared to a single opportunity 
in real time traffic. In addition to the QoS parameters Star and 
N, the size of the buffer B provides another trade-off for 
energy efficiency. It should be noted that in contrast to the 
conventional system design goal of dropping of data packet 
as a consequence of not being able to provide required rate 
to the users, our approach encourages dropping of packets to 
optimize the system energy consumption as long as the QoS 
parameters allow. 

We investigate the energy efficiency of the system con¬ 
strained by the coupled packet drop (QoS) parameters. The 
main contributions of this work are as follows: 

• We allow buffering at transmitter side which allows 
multi-packet scheduling as compared to a single packet 
scheduling in ifTsl . The buffering effect to exploit channel 
diversity has been well studied in literature, but average 
and successive packet (bursty) loss constraints have not 
been investigated simultaneously over fading channels for 
the buffering system. The modeling of successive packet 
loss constraint requires not only to decide how many 
(average) packets need to be transmitted, but which of 
them are more significant with respect to QoE. 

• We propose a novel scheduling scheme which takes into 
account channel distribution, packet loss characteristics 
and maximum delay limitations for a packet. This gen¬ 
eralized framework is more complex due to involvement 
of an additional DoF, but provides better results in terms 
of energy efficiency as demonstrated through asymptotic 
case analysis and numerical evaluation. 

• We investigate and quantify the effect of buffer size on 
system energy mathematically and characterize the dom¬ 
inating regions for each system parameter (e.g., buffer 
size, Otar, N) in terms of energy efficiency. We show 
that increasing buffer size indefinitely does not help to 
increase energy efficiency of the system for a fixed N 
and Otar- 

• The complexity of the proposed scheme is quite high for 
large buffer size. Therefore, we propose and analyze the 
low complexity solutions. The energy loss due to sub¬ 
optimality is evaluated numerically, which reveals the 
interesting result that the optimal and low complexity 
schemes show comparable energy performance. 

The rest of the paper is organized as follows. Section [II] 
introduces the system model and fundamental assumptions. 
We discuss and analyze the proposed scheme in Section HII] 
while the optimization problem is formulated in Section |IV] 
The effect of buffer size on system energy is characterized 


mathematically in Section |V] We discuss low complexity 
schemes in Section |VT] and compare them with our proposed 
scheme. We provide numerical evidence of the tradeoff be¬ 
tween energy, data loss and buffer size for our schemes in 
Section IVIII and conclude with the main contributions of the 
work in Section IVIIII 

II. System Model 

In this paper, we follow the system model used in ifTSl . ifTSll . 
We consider a multiple-access system with K users randomly 
placed within a certain area. The system is able to provide 
a certain fraction of the total data rate to each user. Every 
scheduled packet for a user k has normalized size Rk = % 
where C denotes the spectral efficiency of the system. 

We consider an uplink scenario where time is slotted such 
that each user k experiences a channel gain hk{t) in a time 
slot t. The channel gain hk{t) is the product of path loss 
Sk and small-scale fading /fc(f). The path loss is a function 
of the distance between the transmitter and the receiver and 
remains constant within the time scale considered in this work. 
Small-scale fading depends on the scattering environment. It 
changes from slot to slot for every user and is independent and 
identically distributed (i.i.d.) across both users and slots, but 
remains constant during the time span of a single time slot. 
The multi-access channel is described by the input vector (X) 
and output vector (Y) relation as, 

K 

Y{t) = Y,t/}W)X{t) + Z{t) (1) 

k=l 

where Z represents additive i.i.d. complex Gaussian random 
variable with zero mean and unit variance. The channel state 
information (CSI) is assumed to be known at both transmitter 
and receiver sides. 

The continuity constraint requires us to allow scheduling 
of multiple users simultaneously in the same time slot. If 
only a single user is scheduled per time slot, the continuity 
constraint cannot be satisfied without allowing outage when 
multiple users have already dropped N packets. The analysis 
of the scheme is based on asymptotic user case and therefore, 
scheduling of very large number of simultaneous users is desir¬ 
able. We use superposition coding and successive interference 
cancelation (SIC) mechanism for successful transmission of 
data (rate) of simultaneously scheduled userfl Treating the 
other users as interference, we end up with a Gaussian channel 
for which modulation and coding schemes are well researched, 

e-g-, HU- 

Let K. denote the set of users to be scheduled and 
$1... $1^1 be the permutation of the scheduled user indices 
that sorts the channel gains in increasing order, i.e. < 

• • • < < • • • < ■ Then, the energy of the scheduled 

user ^k with rate is given by, m, eqi 

Ers>^ = ^ ) ( 2 ) 

'Theoretically, there is no limit on the number of users scheduled simul¬ 
taneously if there is no power constraint. In practice, the users have peak or 
average power constraints and apply finite modulation and coding schemes 
which limit the number of users scheduled simultaneously. 
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Fig. 1. The system model settings for the problem. The users make scheduling decisions independently based on the channel conditions and provide rate 
and channel state information to a central controller which provides the required power allocation information to the users for superposition coding. 


where Zq denotes the noise power spectral density. Fig. [T] 
shows the block diagram for the system settings where a 
central controller is responsible for providing the required 
power level information to each user for superposition coding. 

111. Modeling of the Proposed Scheme 

A constant arrival of a single packet with normalized size 
is assumed for simplicity. However, the scheme is not 
restricted to this assumption as a random packet arrival process 
can be modeled as a constant arrival process where multiple 
arrived packets in the same time slot are merged as a single 
packet with random packet size following the framework in 
0, E). The packet arrival occurs at the start of a time slot 
and the scheduling is performed afterwards taking into account 
the newly queued packet. All the arriving packets are queued 
sequentially, i.e., the oldest arrived packet is the head of line 
(HOL). If a single packet has to be scheduled or dropped, 
it has to be the HOL packet. Note that successive packet 
drop constraint inspires us to buffer and drop the packets 
sequentially (as compared to any random queuing strategy) 
because it is essential to maintain a sequence of the packets 
in the transmitter buffer. The newly arrived packet is the 
pointer to indicate how essential it is to transmit or drop a 
packet in relation to successive packet drop constraint while 
the scheduling of the other buffered packets is essential to 
maintain a ceratin average packet drop rate. 

The continuity constraint and buffer size parameters for a 
user k are denoted by N and B, respectively; and assumed to 
be identical for all the userfl The variables d < N and b < 
B denote the number of successively dropped and buffered 
packets for a user k at time t, respectively. A packet arriving at 
time t is not dropped immediately if not scheduled but buffered 
up to B time slots and dropped then (if still not scheduled). 

We use Markov decision process (MDP) to model and 
analyze the scheme which is a useful tool due to dependency 
of the dropped packets in relation to the successive packet 
drop sequences. The state space of the user is defined as 

A = {(P, H); I? e {0,..., iV}, 6 e {0,... S}} (3) 

^The framework can be generalized to non-identical N case for individual 
users where N follows a probability distribution, but this is out of scope of 
this work. 


where V and B denote the state space for successive packet 
drop and buffer states, respectively. Then, the state is defined 
as a composite variable p G A by the summation of the num¬ 
ber of (already) successively dropped and (already) buffered 
packets at time t such that 

p = d + b . (4) 


At the start of the Markov process (p = 0), the packet is not 
dropped if not scheduled as packets can be buffered for B 
time slots resulting in d = 0 and p = b for p < B. When the 
buffer is completely filled with packets, the unscheduled HOL 
packet is dropped onwards. Note that, the dropping operation 
is limited to a single packet as this is enough to make room 
for the newly arriveq^ packet at time t. Thus, the variable d 
increases and b is fixed to B forp > B. The maximum number 
of states in our Markov chain is B + N+1 where M = B + N 
denotes the termination state. 

Let Upq denote the transition probability from a state p to 
q with St representing the state at time t. Furthermore, we 
denote transition probabilities associated with the scheduling, 
buffering and dropping decisions by the notation a® , a'. 


^ and 


Qfpq, respectively. We define 


^pq 


= Br{St+i=q\St=p) = 


as 



Vp, q < min(p, B) 

Oi^ 

PQ 

p < B,q = p + 1 

PQ 

P > B,q = p + 1 

0 

otherwise 


( 5 ) 


where possible values of the states form the finite state space 
A of the MDP. 

We define a scheduling threshold. 


Definition 1 (Scheduling Threshold Kpq). It is defined as the 
minimum small scale fading value fk required to make a state 
transition from state p to q such that 

a'pq = Pi{Kpq < fk< Kp{q-1)) 0 < g < min(p, B), (6) 

where KpQ- is defined to be infinity with Sq- denoting a 
dummy state before Sq. 


Thus, the small scale fading must be greater than Kpq for the 
process to enter in state q where fading vector is quantized in 


^The newly arrived packet waits in a separate temporary buffer momentarily 
before the scheduling decision as it arrived at the start of the time slot. 
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non-overlapping intervals. The threshold definition uses fading 
instead of channel gain to avoid near-far effect which gives 
unfair advantage to the users near the base station. 

A. The Proposed Scheduling Scheme 

We assume an infinitely large number of users in the 
system for the analysis purpose. This assumption makes the 
analysis of the scheme tractable which is hard otherwise due 
to multiuser interactions. In the large user limit, the scheduling 
decisions of the users decouple and the multiuser system can 
be modeled as a single user system following the work in 
El, E). Every user makes his own scheduling decision 
independent of the other users. 

The purpose of the scheduling scheme is to maximize the 
use of available fading conditions by scheduling as many 
packets as possible. Thus, the fading vector is quantized in 
such a way that the discrete set of state-dependent scheduling 
thresholds determine the intervals for the optimal scheduling 
decisions and the size of this vector equals the number of 
packets available for scheduling in a state p. 

In a state p > q, the scheduler makes a state transition to 
state q for fading fk such that 

i^pq < fk < Kpi^q-i), 0 < q < mm{p, B) . (7) 

For a state transition with q < mm{p,B), the number of 
the scheduled packets is given by 

L{P, f) = min(p, B)-q + l , (8) 

where q is determined uniquely by ©. Obviously, a user 
can only schedule as many packets as buffered. Thus, the 
maximum scheduled packets for a state p < B are limited 
to p — q + 1 (due to constant arrival model) while they are 
fixed to B — q + 1 for p> B. 

Note that scheduling of packets starts with the HOL packet 
and ends with the most recently arrived packet. Given a state 
P, 0 chooses q which maximizes the average system reward. 
To meet the continuity constraint with probability one, kmb 
is set to zero to allow transmission of the HOL packet in state 
M, similar to the approach used in ll22ll . 

We deduce the following properties of the proposed schedul¬ 
ing scheme from 0. 

Property 1. The next state (in case of scheduling) is limited 
by the minimum of p and B. If p < B, q cannot exceed p, 
otherwise it is limited to B. 

Thus, up to min(p,i3) buffered packets and one newly ar¬ 
rived packet can be scheduled depending on small scale fading 
in a state p. As the quantity min(p, i?) is used extensively in 
the analysis, we denote it by p in the rest of the paper. 

Property 2. Scheduling thresholds in a state p follow the 
monotonic decrease property that 

i^pq < i^piq-1) , Vp, 0 < g < /r . (9) 

Property 3. For a state transition resulting in the scheduling 
of the same number of packets, 

Kpq < K(p_l)(,_l) , Vp,0 < q < p, . (10) 


The closer to termination state, the smaller the thresholds 
and the larger the energy expenditure. This follows from the 
classical optimal stopping theory formulation where expected 
incentive in waiting decreases monotonically fTlX . 

If / < Kpfj,, no scheduling occurs. In this case, the next 
state q equals p -1- 1 but a packet can be dropped or buffered 
depending on the conditions in (fTTT i and (fTSl i such that 

p 

ap(p+i) = Pr(/ < Kpp) = 1 - ^ , p<B . (11) 

Kpp denotes the minimum threshold for self transition to 
schedule at least one packet. If p < B, the unscheduled HOL 
packet is buffered with the option that it can be scheduled in 
one of the B — p time slots in future. 

B 

aiip+1) = Pr(/ < «pb) = 1 - XI “p9 ’ P>B, (12) 

where KpB denote the minimum thresholds to make transition 
to state B for scheduling at least one packet. If p > i3, the 
unscheduled HOL packet has to be dropped as the buffer is 
already full. The best option by continuity constraint point 
of view is to drop HOL packet to make room for the newly 
arrived packet. 

The state transition probabilities are computed as a function 
of scheduling thresholds via ®, E) and (fl^ . and depend 
on the fading distribution and QoS parameters {N,B,9tax)- 
The state transition probabilities constitute the state transition 
probability matrix Q for an MDP. 

To explain the computation of matrix Q, consider the state 
diagram in Fig. |2] for different values of N and B parameters. 
The corresponding matrix Q for a system with iV = 1 and 
S = 2 is given by 


/ 

“00 

0^01 

0 

0 

\ 


“10 

“11 

ai 2 

0 



“20 

“21 

^22 

0^23 


v 

0^30 

ail 

a 32 

0 

/ 


Appendix 0 shows the general relation between matrix Q 
and scheduling thresholds for a small scale fading distribution 

Pfiy)- 

It should be noted that the number of states in an MDP 
are the same for the parameter sets N = 2, B — 1 and N = 
1,B = 2 but the transition probability matrix Q differs and 
captures the effect of each parameter on the system energy. 
The parameter set N = 2, B = 1 requires optimization of 2 
thresholds per state for p > 1 while the parameter set N = 
1,B — 2 requires 3 thresholds per state forp > 2. We evaluate 
the energy efficiency of both cases numerically in Section Ivnl 


IV. Optimization Problem Formulation 


The number of scheduled packets are considered virtual 
users (VU) for the analysis purpose. It is known that the 
average energy consumption of the system per transmitted 
information bit at the large system limit K ^ oo is given 


by El, El 




2^ P/»,vu(a;) 


dP 


h,VU 


(x) 


(14) 
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(a) N=2, B=1 


(b) N=l, B=2 


Fig. 2. State transition diagrams for different buffer size and continuity constraint parameters. The green colored states represent the buffering states while 
grey state is the M*** state. The superscript notation with transition probabilities depict the actions associated with the transitions. 


where Pti,vu(') denotes the cumulative distribution function 
(cdf) of the fading of the scheduled VUs. It comprises of the 
small scale fading and the path loss components of the VUs. 
However, in the large system limit, the state transitions depend 
only on the small scale fading distribution as the path loss for 
VUs follows the same distribution as the path loss of the users. 

The optimization problem is to minimize system energy for 
the target average and successive packet drop constraints. We 
formulate the problem with the help of MDP model described 
in Section Hn] and optimize the state transition probabilities (or 
resulting Q) offline. 

Consider the following optimization problem: 


channel-dependent optimal scheduling thresholds can be com¬ 


puted from the optimized a* = [Upg,... usmg 4 yjr. 

( ^ ) in (O requires computation of channel distribu- 

tion for the scheduled users. From our MDP model for the 
problem, the probability density function (pdf) of the small 
scale fading of the scheduled VUs is computed as a function 
of the optimized scheduling thresholds by 


using dhjl. 


M 


P/.Vu(2/) = V CpL{p, y)TTp p/(y), Kpq < y < Kp{q-1) 


P—0 


(19) 


s.t. 





minqgn 

(^Osys 


:Q O^pm ^ 1 

0 <ap„<l,0: 

; 0 tar 


Q e u 

0 

P 

= 1 

0 < p < M 

N = 

M 

B < 00, N < 00 


where U denotes the set of permissible matrices for Q. Note 
that 0 tar is the target (maximum) average packet drop rate 
while 9r is the average packet drop rate resulting from our 
proposed scheme and any fixed Q according to (fThl) such that 


where Pfijj) and Cp denote the small scale fading distribution 
( 15 ^nd a normalization constant while L{p, y) is given by ®. The 
pdf represents the conditional probability of being in state p 
and transmitting L{p,y) packets for small sale fading y. 

The cdf of the VUs can be written as a sum of integrals 

(J6) 

M 

E' 

p—O 


M-l 


M -1 


Ck-r) 


M 

^ TTp = 1 . 

P—0 


( 20 ) 


(17) 


9r — ^2 Otp{p+l)'^P — E E 

p—B p—B m—0 

Equation (fTTl i results by combining Ci and Cg in (fThl) while TTp 
denotes the steady state probability of the state p and follows 
the property of homogenous Markov Chain that 


P/.vu(y) = '^CpTTp(^L{p,y) J 

P/(?)dC) 

p-q 

\L{p,y)Pfiy)-Y. 


fi-q 


6=1 




—b+ 1 ) 


M 

E' 

p =0 


■p'^p \ 


6=0 


(18) 


since no virtual users are scheduled for y < Kpp. The channel 
distribution for the scheduled VUs is computed in close form 
for Rayleigh fading and the path loss distribution in Appendix 


A. Heuristic Optimization 


In our MDP model, the forward transition for the state p > B 
represents the events of dropping the packet in state p and 
the summation over the corresponding transition probabilities 
ctp(p+i) gives the average dropping probability 9r. The sum¬ 
mation in (fTTI) starts from state B as the unscheduled packets 
are buffered for p < B. 

The constraint on average packet drop rate is modeled by C 2 
in ( fThb . The constraint on successive packet drop is modeled 
through C 4 while Ci and C 3 are the standard constraints on 
the transition probabilities. For a fixed p, the corresponding 


The optimization problem is to compute a set of transition 
probabilities that result in minimum system energy in (fT4l) . 
For every state p, an optimal a* = [a*g,... a*^] needs to be 
computed. The computation of optimal a* under constraints 
in is a stochastic optimization problem and requires 
heuristic optimization techniques like genetic algorithms, neu¬ 
ral networks, etc., which provide acceptable solutions with 
reasonable computational complexity. 

We choose Simulated Annealing (SA) to compute the so¬ 
lution for the optimization problem. SA is believed to help 
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avoiding local minima by probabilistically allowing a candi¬ 
date configuration to be the best known solution temporarily 
even if the configuration is not the best available solution at 
that time. This is called muting. The muting occurs at a faster 
rate at the start of the optimization process and decreases as the 
process goes on. This behavior is analogous to first heating at 
high temperature and then cooling the object. There are many 
cooling schedules used in literature, e.g., Boltzmann annealing 
(BA) and Fast annealing (FA) schedules, etc., 123, ll24l . We 
employ FA in this work. In FA, it is sufficient to decrease the 
temperature linearly in each step j such that. 


Tj = 


To 

t'saj T 1 


( 22 ) 


where Tq is a suitable starting temperature and Csa is a constant 
adjusted according to the requirements of the problem. 

Algorithm [T] shows the pseudocode for application of SA 
to our optimization problem at a single temperature iteration. 
The MDP models the problem for finite B, N while the matrix 
Q represents a candidate configuration for SA. In SA, one 
transition probability fulfilling Ci — C 4 (in ( fTbl l) in Q is varied 
randomly in a single iteration and the objective function, 
(^) Gil, is computed only if the constraint C 2 is 

satisfied. If the solution improves the previous best solution, 
the new configuration, i.e., the proposed Q, is selected as the 
current best solution, discarded otherwise. However, with a 
certain probability, the new matrix Q can be selected as the 
best solution even if it does not improve the previous best 
solution. This helps to avoid local minima as explained earlier. 
After n iterations, the temperature is lowered down according 
to (I 22 I 1 and the routine is repeated untill the temperature 
reaches its lower limit and the computed end solution is 
considered optimal. We omit the details of SA scheme here 
due to space limitations but the reader is referred to 
for the details of SA algorithm. 


V. Characterization of Buffer Size 

In this section, we characterize the effect of buffer size on 
system energy. For a fixed N, an increase in buffer size B 
causes increase in quantization levels for (per state) fading 
vector. Larger the quantization levels, the better the use of 
fading to improve energy consumption in our scheduling 
scheme. Technically, larger the buffer size, the more is the 
waiting time for a specific packet to wait for an optimal time 
slot to get scheduled. This result follows directly from the 
finite horizon optimal stopping theory. 

To characterize the effect of buffer size on our scheme, 
we compare two systems with equal N: a system with no 
buffering and a system with buffer size B. As M = N + 
B, denoting continuity constraint parameter for a zero buffer 
system by N implies M = N and GtI i reduces to 

N-l 

Qr — ^ ^ U^p(p-t-l)Tp . (23) 

p=0 

Eq. (I 23 I 1 for B = 0 is identical to GtI i for a system with 
i? > 0 in terms of number of states summed over as B is 
added in both upper and lower limits to get GtT i. The only 


Algorithm 1: SA Algorithm iterations for a single iteration 
at temperature T 

Input: (Q*,£’*,T,6ltar); 

E*= current minimum energy; 

Q*= current probability matrix solution ; 

/* Generation of n random Q for a 

temperature T. */ 

for i=0 to n do 

Generate a random Q and compute 9^ for Q ; 

/* Evaluating C2 */ 

if {9r < dtar) then 

Compute energy E as a function of Q-, 
if (Muting is true) then 

Q* = Q, 

/* Energy update step. */ 

if (.E < E*) then 
I E* =E\ 

end if 
end if 
end if 
end for 

return {E*,Q*); 


difference between the two systems emerges from the fact that 
a proportion of the packets for a buffering system is scheduled 
before reaching state B in MDP. Let Xb denote the effective 
rate of unscheduled packets entering state B (as compared to 
one in case of no buffering) and given by 

B — 1 p 

Ab = Q!(B-l)B7rB-l = 1-EE CT-pqttp (24) 

p=0 q=0 

The increased energy efficiency in the system with buffering 
is due to the packets scheduled in first B (0 to B — 1) states 
as explained in the following. 

1) The packets scheduled in first B states help to reduce 
the number of packets to be scheduled in the remaining 
N states while Star is the dropping rate for the packets 
entering the system. As a fraction 1 — Ab of the packets 
is scheduled before state B, a larger proportion of 
the remaining packets can be dropped in remaining 
N (B to M) states to meet Star as compared to the 
system without buffering which increases DoF and helps 
to reduce system energy. Thus, due to scheduling in 
additional B states, the resulting system energy for the 
dropping probability Star and buffer size B is equivalent 
to the system energy for the case B = 0 and dropping 
probability Star/AB- In numerical results, we quantify 
that the energy gain due to factor Star/AB provides a 
lower bound for the improvement as additional gain is 
achieved in scheduling in first B states as explained 
below. 

2) The packets scheduled in first B states result in smaller 
average E^/Nq as compared to average E^/No in the 
remaining N states. This follows from Property 3 and 
cni). An increasing scheduling threshold implies smaller 
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energy consumption in the scheduled packets in first B 
states. 

A. Limiting the Buffer Size 

In practice, the continuity constraint parameter values are of 
the order of a few tens. Our results in Section IVIII show that 
increasing the value of B for a fixed N results in a decrease of 
energy. Naturally, one would like to have a large value for B 
to maximize the energy gain. However, B cannot be increased 
indefinitely due to the following phenomenon. 

For a fixed N, the system energy saturates at some = 
Otar and 6 >tar > 6 *iim does not improve the energy efficiency 
(c.f. Lemma 1 in lfT3l as reviewed briefly in Appendix 0 
where is the solution of (fTsT i without applying C 2 in (fThl l. 
In the following, we study the effect of buffer size B on this 
particular parameter. We observe from the numerical results 
that increasing B for a fixed N shifts 0iii„ towards zero average 
dropping probability, i.e., increasing B further does not help 
to increase the system energy efficiency significantly. 

We deduce the following Lemma based on our evaluation. 

Lemma 1. There exists a value of B that maximizes the energy 
gain via buffering for a fixed N. It is independent of 0tar and 
denoted by Bm{N). 

Proof: We obtain Slim = 9r from the solution of the 
problem solely constrained by the continuity constraint, i.e., 
by solving (fTST l without constraint C 2 for forward transition 
probabilities for B < p < M — 1. As B increases with 

respect to N, the steady state probabilities itp for B < p < 
M — 1 decrease because a buffered packet gets B opportunities 
to exploit multiuser diversity before it gets dropped if not 
scheduled. From (fTTl i. depends on tip for B < p < M —1 
as well, i.e., 0iim(7rB, ttb+i, ..., ttm-i)- If one component 
ttb, TTs+i,..., ttm-i is decreasing, decreases for a fixed 
N i e., > 0 for B < ; < M - 1 . 

Eventually 0iini —0 for B/N >> 1. ■ 

Thus, parameter N constrains B via Lemma 1. However, 
this maximization is achieved at dn^ = 0 which implies that 
the value of fftar and N is irrelevant as it is not permissible 
to drop any packet regardless of N. Therefore, the value of 
B < Bjn{N) should be designed such that the DoF available 
from N and 0tar can be utilized effectively to improve the 
system energy efficiency in conjunction with parameter B. 

VI. SuBOPTiMAL Scheduling Schemes 

The computational complexity of the scheme depends on 
the number of quantization levels (thresholds) per state which 
in turn depend on buffer size. In practice, the buffer size is of 
the order of a few tens to hundreds. In this case, the ”Best’0 
scheduler presented in our work results in computational 
complexity (for the thresholds) of the order 0 {MB) and in 
the case B » N, it becomes 0{B'^). In the following, we 
propose suboptimal schedulers which reduce the complexity 

“^Please note that we avoid using term optimal as the solution of the scheme 
presented in Section HTT] cannot be proven optimal in the mathematical sense. 
To differentiate this scheme with low complexity schemes, we term the scheme 
as ’’Best” in the subsequent work. 


at the marginal energy loss. These schedulers exploit non- 
uniform distribution of transition probabilities in the original 
optimal scheme. For every state p, the original scheduling 
scheme is based on the idea of allowing scheduling of multiple 
packets for opportunistic use of good channels. 

However, the computational complexity can greatly be 
reduced by merging some of the transition probabilities in a 
smart way. All the suboptimal schemes share one property that 
scheduling of at least one packet per state must be facilitated 
to maximize the satisfaction of continuity constraint before 
reaching M*** state. Note that the forced transmission in M*** 
time slot results in large energy expenditure. 

A. One-Or-All Scheduler 

This scheduler is the simplest in terms of computational 
complexity and implementation. Instead of having the option 
of scheduling up to p -b 1 packets in a state p, the user is 
limited to schedule either one, /i-|-l or no packet at all, thereby 
this scheme is called One-Or-All (OOA) scheduling. All other 
transition probabilities are set to zero. The idea is motivated by 
the emptying buffer scheme in ii where a user either empties 
the buffer when scheduled or waits for the next time slot. 
The computational complexity for this scheme is 0{B) for 
B/N >> 1 as only 2B transition probabilities need to be 
optimized. 

Following the derivation in (ISTT i. the cdf of the VUs for 
OOA is given by 

M f (p + l)P/(j/)- 
P/.vu(y) = X! ^p'^'P f y > 

[Pfiy) - Kp/j. <y < Kpo 

(25) 

B. Selective State with Exponential Merging (SSE) 

As OOA scheduler quantizes every channel state into 2 
levels per state, the performance is expected to degrade rapidly 
as compared to the ’’Best” one. A tradeoff between OOA 
and the ’’Best” scheduler is Selective State with Exponential 
merging (SSE) scheduler with complexity 0{B log B). Out of 
the possible state transitions, one state transition is dedicated 
for allowing scheduling of a single packet to maximize the 
probability of meeting the continuity constraint after dropping 
N packets successively. Thus, the non-zero probability for the 
state transition from state p to p is a must for every state p. Eor 
the selection of other possible state transitions, we propose the 
following method where we choose thresholds exponentially. 

Eor a state p, we observe in transition probability matrix 
for the ’’Best” scheme that the state transition probabilities 
are quite high for state zero, state p and the states which are 
closer to zero. Therefore, other than (as in OOA), a natural 
choice of allowed transition probabilities is as follows. 

Eor a state p, select a vector of possible next states by 

q = [2° - 1,2^ _ i _ _ 2 L'°S 2 (m+i)J _ 1 ^^] (26) 

where the state transition from p to p is appended in the 
end only if it is not already contained by the vector, i.e.. 



2 L'°S 2 (m+i)J _ 1 ^ ^ xjie next states for the state transi¬ 
tions are more concentrated for the states with small q and 
sparsity increases exponentially following our observation for 
the transition probability matrix for the ’’Best” scheme. Please 
note that smaller q implies transmission of more packets and 
vice versa. The loss in merging states with large q is small as 
compared to merging states with small q. 

For this scheme, the corresponding state transition matrix 
Qsse for a system with N = 2 and B = 3 is given by 


/ 

“00 

“oi 

0 

0 

0 

0 

\ 


afo 

“11 

“12 

0 

0 

0 



0^20 

“21 

“22 

“23 

0 

0 



“30 

“31 

0 

“33 

“^4 

0 



“40 

“41 

0 

“43 

0 



V 

“io 

“51 

0 

“53 

0 

0 

/ 



The general closed form expression for computation of 
VUs’ channel distribution in terms of p is not straight forward 
due to its dependency on vector q. For every p in matrix 
Qsse, the general form of the expression varies and can be 
derived using the derivation technique explained in Section Hill 
for the ’’Best” scheme. Thus, for every p, corresponding q is 
computed using (l26l l which determines the channel distribution 
as follows: 

The channel distribution of VUs for SSE is conditioned on 
state p (as before) and given by 

M 

P/,vu( 2 /) = Y,c,n,Pf{y\p) (28) 

p —0 

where Pf{y\p) in turns is conditioned on q such that: 

For 1^ = 1, 


P/(2/b) =P/(j/)-P/( kpm) y>^po 


For 1^ = 2, 


(29) 


{ {y + 1)P f{y)-Pf{Kpo) y>Kpo 

-P/(Kpi) 

P/(j/)-P/( l^p/i ) Kpp, <y <Kpo 


For 1^ = 3, 


(30) 


{p+l)Pf{y)-Pf{Kpo) y>Kpo 

“ l)P/(^pi) ~ P/(^pm) 

P/(yb) = yPfiy) - (m - i)P/(Kpi) Hpi <y < ^po 

— Pf^Kpp,) 

P/(j/) - P/(«^Pp) Kpp<y < Kpi 

(31) 

Following the proposed framework, channel distribution can 
be calculated for all \q],p. 


VII. Numerical Evaluation 

In this section, we provide some numerical examples to 
demonstrate the potential gain of our scheme. K users are 
uniformly distributed in a circular cell except a forbidden 
region of radius 5 = 0.01 around the access point and the path 
loss follows the distribution in lfT3l . iflSl . We assume Rayleigh 


Fig. 3. The system energy as a function of average packet dropping 
probability and buffer size parameters for a fixed N. The values for 
have been pointed with elliptical shapes for every curve. 


fading with mean one and the path loss is exponential with 
exponent 2. The value of C is fixed to 0.5 bits/s/Hz. 

Eirst, we present an example of numerically computed 
Qbst matrix for the ’’Best” scheme. The matrix is a function 
of scheduling thresholds as computed in Appendix Eor 
example, we have the following matrix for N = 2, B — 1 
and 0 tar = 0 . 02 . 


/ 

0.69 

0.31 

0 

0 \ 


0.76 

0.17 

0.07 

0 


0.67 

0.27 

0 

0.06 

V 

0.64 

0.36 

0 

0 / 


We show system energy against target average packet drop 
probability in Eig. |3] and focus on the effects of buffering 
on a system constrained by parameters 9ta.r,N. For a fixed 

N, the system becomes more energy efficient as B increases 
due to the effects characterized in Section |V] For the case 
B = 1,N = l, 6 >tar = 0.02, the parameter 0tar/A_B equals 

O. 09 from the matrix in ( l32l i. This implies a system with 
buffer size one and 0 tar = 0.02 is more energy efficient 
than a system with B = 0 and fftar = 0.09 as evident 
from Fig. [3 Thus, flexibility in latency requirements helps to 
combat the transmission challenges emerging from the finite 
packet dropping parameters. We observe also that the energy 
expenditure for the parameters B = 2, N = 1 is substantially 
smaller than the case with parameters B = 1, N = 2 (where 
M — 3 for both cases) at small 0tar, but the opposite holds at 
large dtar- Thus, it is important to realize the operating region 
for the system to maximize the advantage from DoFs. 

As evident from Fig. |3 an increase in value of B results 
in decrease in the value of dum as explained in Lemma 1. In 
this particular numerical example, Bm{N) = 6 for N = 1. 
However, allowing B = 6 is not beneficial from the energy 
point of view as no packet loss tolerance is exploited. If 
the system is loss tolerant in terms of average packet drop 
rate, B < Bm{N) can be designed to maximize the energy 
efficiency without wasting extra buffer (cost). For example. 
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0.05 


03 

■D 


(a) 6>tar = 0 


(b) 0tar = 0.05 



Fig. 4. The system energy as a function of parameters S, N and Otar in a 3-dimensional illustrations. 



0 12 3 

B 


Fig. 5. The accuracy measure A[%] as a function of buffer size and continuity 
constraint parameters while Otar = 0.05 for all simulations. The number of 
temperature iterations are 100 while 50 * (M + 1) random configurations of 
Q are simulated at one temperature. The value of initial temperature To can 
be fixed according to the problem requirements. 


for Star = 0.05 and N = 1, B = 3 realizes nearly the same 
energy as for the cases B > 3 at the reduced cost of buffering. 

Fig. a exhibits the effects of different combinations of 
parameter sets B,N and Star on the system energy in a 
3-dimensional plot where the effect of each parameter in 
different operating regions is evident. Fig. |4(a)| for parameter 
Star = 0 is a special case where N becomes irrelevant as zero 
average packet drop rate implies that the system is lossless 
and thus, N > 0 does not help (as no packet can be dropped). 
However, if Star = 0, an increase in the value of B does 
help to make the system energy efficient as shown in Fig. 
|4(b)| A system with parameters B = 2,N = l,Star = 0 is 
almost as energy efficient as a system with i? = 1, = 1 

and Star — 0.05, i.e., with an additional freedom in average 
dropping probability. 

To measure the relative accuracy of the computed solution 


TABLE I 

Comparison of Schemes eor N = 3, B = 3, C = 0.5 bits/s/Hz 


Scheme 

Complexity 

Eb/No(9tar =0.1) 

’’Best” 

0{B^) 

-5.92 dB 

OOA 

Op) 

-5.83 dB 

SSE 

0{B log B) 

-5.86 dB 


for the SA algorithm, we define a parameter A by 
^ ^ ■ min(Strr,Surrr) 

where S* is computed for a given Star by using dnli for 
the ’’Best” solution Q*. This measure specifies how close S* 
for the solution to the targeted maximum value of average 
packet drop is. Though, it is not true always that the smaller 
A guarantees more energy efficiency as there exist some 
combinations of transition probabilities which result in smaller 
Sr but larger energy. A small A implies that the computed 
solution is sufficiently exploiting the DoF inherited by the 
system through parameter Star- 

Fig. 0 shows A for different combinations of parameters 
B and N. We observe that A is quite small for the com¬ 
puted solutions for the ’’Best” scheme. As SA is a heuristic 
algorithm, there is no consistent pattern in the values of A. 
In general, for a fixed number of temperature iterations, the 
computation of the solution is expected to be hard as the 
number of parameters involved increases, i.e., dense Q and 
large number of thresholds. If A is large for a parameter set, 
the number of iterations are increased to improve the solution. 

Table U illustrates the comparison between the ’Best’ and 
low complexity sub optimal schemes. Note that all the schemes 
fulfill the continuity and average packet drop constraints 
effectively. As expected, OOA is the least complex scheme 
but energy loss is substantial as compared to the ’’Best” one 
at 0tar = 0.10. The SSE scheme is more complex than OOA; 
but improves energy performance over OOA while energy loss 
is minimal as compared to the ’’Best” one. 

Fig. |6] compares the system energy and achieved average 
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(a) Eij/Nq for the ’’Best” and OOA schemes for Rayleigh fading distribution. (b) Or for the ’’Best” and OOA schemes for Rayleigh fading distribution. 

Fig. 6. The comparison of the ’’Best” scheme against suboptimal low complexity OOA scheme. The simulation parameters are B = 3, N = 3. Note that 
the curves for both schemes with 200 iterations per temperature value are overlapping with OOA curve with 100 iterations and are hardly distinguishable. 


packet drop rate 0^. for the ’’Best” and OOA schemes against a 
target average dropping probability. At low Star both schemes 
are indistinguishable as DoF provided by Star for N = 3 
and B = 3 is too small to be capitalized effectively by any 
scheme. As Star increases, the ’’Best” scheme shows better 
results than OOA as expected. However, for large Star. OOA 
outperforms the ’’Best” one surprisingly. This is attributed to 
the fact that reduced complexity (less number of thresholds) of 
OOA helps to compute more accurate matrix Qqoa (via SA) 
as compared to the ’’Best” scheme where Q^st i^ dense. This 
effect is more pronounced at large Star due to more freedom 
in choosing transition probabilities in matrix Q*. 

To verify our observations numerically, we repeat the 
optimization by increasing number of iterations per single 
temperature from 100 to 200. We also plot the achieved 
Sr for both schemes in Fig. | 6 (b)| Initially, S^ follows Star 
closely for both schemes. It is clear that 9r for OOA is much 
closer to Star in the region when OOA outperforms the ’’Best” 
scheme. At Star = 0.15, increase in number of iterations in 
SA algorithm from 100 to 200 improves accuracy of Q* (and 
thus system energy in Fig. | 6 (a)| i for the ’’Best” scheme while 
it has negligible effect on OOA scheme as Sr for OOA was 
already close to Star- Thus, we conclude that the proposed low 
complexity schemes perform very close to the ’’Best” scheme 
and extensive fading vector quantization levels for the ’’Best” 
scheme do not help much. 

VIII. Conclusion 

We investigate the tradeoff between the system energy of 
multiuser multi-access system and the packet drop tolerance of 
the applications characterizing the network traffic. In contrast 
to common approach of dropping a packet as a consequence 
of failing to provide a required rate to the users, we propose 
maximizing the use of packet drop tolerance by dropping as 
many packets as permissible without compromising the QoE 
for the users. The joint constraint on permissable average 
and successive packet drop poses interesting challenge in 


the optimization problem. We propose a packet scheduling 
scheme and analyze it using MDP under large user limit. 
As the formulated optimization problem is non-convex for 
a multiuser system, the heuristic solution is presented. We 
also propose suboptimal low complexity schemes which show 
negligible energy loss as compared to the proposed ’’Best” 
scheme. The numerical results evaluate trade-offs between the 
system energy and QoS parameters. The study reveals that 
system energy is influenced by different parameters in different 
operating regions and it is important to quantify the effect of 
each parameter to opportunistically make use of the channel 
for an energy efficient system design. In future work, we 
consider the proposed framework in more realistic scenarios 
when CSI is not available and only statistical guarantees can 
be provided on successive packet loss. 

Appendix A 

Relation between Thresholds and Transition 
Probabilities 

There is one to one mapping between transition probabilities 
and thresholds. For a given set of parameters B, N and small 
scale fading distribution P/(y), the transition probabilities are 
a function of the scheduling thresholds. Therefore, computing 
a set of thresholds is equivalent to the computation of a set of 
transition probabilities. 

For a fixed M = B + N (assuming B > 1), the transition 
probability matrix Qbst for the ’’Best” scheme in is expressed 


as. 

/ Pr{y > Koo) 

Pr{y < Koo) 

• ° \ 

Qbst — 

Pi(y > Kio) 

Pr(Kii <y < Kio) 

• 0 


\ Pr(y > kmo) 

Pr(KMi <y < Kmo) ■ ■ 

• 0 j 


where zero transition probability represents the impossible 
transition. 
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Appendix B 
Channel Modeling 

In this work the channel model of ifTSl . flSl is used. Signal 
propagation is characterized by a distance dependent path loss 
factor and a frequency-selective short-term fading that depends 
on the scattering environment around the user terminal. As 
described in SectionHIl these two effects are taken into account 
by letting hk = Skfk- 

As in ESI, we assume that users are uniformly distributed 
in a geographical area but for a forbidden circular region of 
radius 6 centered around the base station where 0 < (5 < 1 
is a fixed system constant. Using this model, the cdf of path 
loss is given by 

r 0 a: < 1 

F«(^) = S 1 - l<a:<5-“ . (35) 

[ 1 X > S~°‘ 

where the path loss at the cell border is normalized to one. 

For a Rayleigh channel, the distribution of small scale block 
fading is given by 


P/( 2 /) = 1 - exp(-j/) (36) 


Ph{x) is defined as the cdf of the random variable hk = Skfk- 
Recall from (I 2 TI) . the cdf P/_vu( 2 /) of VUs for the ’’Best” 
scheme is a weighted function of the cdf of actual fading 
Pf{y) given by (l36l) . Using (l 2 Tli and (l35l) . we compute a 
convenient expression for the cdf P/i,vu(a^) of the VUs for 
this product channel. As path loss and Rayleigh fading occur 
simultaneously and independently, the cdf of the channel gain 
is given by 


Ph,vv{x) = J Fs{x/y)dPf^y\j{y). (37) 

Using path loss distribution in dlST l. (IJTI) is computed as 
follows 


pXO nX 

Ph,vv{x)= / P/.vu(y)dt/-f / Fs(a;/t/)dP/,vu(y) (38) 
Jo J x 5 °^ 

Following the derivation in ED, changing variables and inte¬ 
grating by parts yields, 

Pavu(^) = P/,vu(l/“/^)dy . (39) 


For a = 2, ( |39] | can be written in closed form. 

For the ’’Best” scheduler, using (ItTIi and the Rayleigh fading 
model, (l3^ is given by 


P/i,vu(a;) = 


1 


M 

X 

P —0 

p-q 


Crt'TTr) 


Lip, y) 


s(l - 5"^) JxS‘^ 

(1 - exp(-?/)) - ^(1 - exp(-Kp(^_b))) 

Integrating dTIT l yields 


b=0 


(40) 

dy . 


M ^ 

\yvix) = CpTTp Lip, y) (^1 - 1 - (exp(-a;) 


p—0 


p-q 


- exp(-a;^^)) ) - X! “ exp(-Kp(^_b))) 


b=0 


(41) 


Similarly, the channel distribution of VUs for OOA and SSE 
schedulers can be computed using ( l25l l and ( l28l l, respectively 
in (l39l). 


Appendix C 

Review of Lemma in ifTJIl 

The behaviour of system energy for a fixed N and large 
0tar has been characterized by a Lemma in ifTSl . We review 
the Lemma briefly here to keep this work independent and 
self-sufficient. The lemma states: 

Lor a fixed continuity constraint parameter TV, there exists 
a finite such that for all 0tar > the same maximum 
energy efficiency is achieved as for more restrictive 

We evaluate E21l from Q* to get the resulting average 
dropping probability 9*. Lor a fixed TV, we will not be able 
to achieve further energy efficiency by dropping more packets 
for any 0tar > ^lim- The 9* value for the special case is termed 
as limiting average dropping probability and denoted by 0iii„ 
when we have no average packet dropping constraint but only 
continuity constraint. Equivalently, 0iii„ is the solution of (fTsT i 
without applying C 2 in ESJ. 
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